Set Functionality over DHT Systems

نویسندگان

  • Nuno Lopes
  • Carlos Baquero
چکیده

For the last few years, Peer-to-Peer systems were created with the purpose of distributing information across a very large set of hosts. The scalability demands of such systems lead to the design of Distributed Hash Tables algorithms[1, 2, 3, 4], which are capable of locating an object anywhere on the system given a key value using logarithmic (or constant) message hops while maintaining logarithmic state at each host. Nonetheless, DHTs are not able to search for objects but only to retrieve them using keys. Although DHT algorithms locate data objects efficiently, they do not provide any data consistency when using replication. Data replication is made on a “best-effort” way in which data may become inconsistent or even lost. An atomic data access DHT extension which maintained replica consistency using a state machine replication technique was proposed in [5] but required the system to have a stable host membership in order to progress. This requirement is not feasible on wide area systems, like the Internet, where hosts have small uptimes and are under constant churn[6]. Another protocol was proposed in [7] for supporting atomic mutable data in DHTs which is based on the Paxos consensus protocol, but its results have not yet been published. While pursuing efficient search functionality on top of the DHT layer, we implemented a simple set functionality. Our algorithm uses a distributed B-tree to store set data over several hosts. This tree algorithm allows us to perform load-balancing on peers while maintaining logarithmic access to data items. We use any generic DHT algorithm as a simple storage layer, accessing data blocks through a simple key-based routing operation. This operation, route(key,message), is capable of delivering a message to the host responsible for the key using the DHT scalable properties. Our presentation will discuss the issues in implementing a tree-based algorithm over a distributed large-scale key-based routing system. We will also discuss how fault-tolerance (through block replication) influences both data consistency and availability.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SkyEye.KOM: An Information Management Over-Overlay for Structured P2P Systems

In order to ease the development and maintenance of more complex P2P applications, which combine multiple P2P functionality (e.g. streaming and dependable storage), we suggest to extend structured P2P systems with a dedicated information management layer. This layer is meant to generate statistics on the whole P2P system and to enable capacity-based peer search, which helps the individual funct...

متن کامل

Multi-set DHT for interval queries on dynamic data

Scalability is a fundamental problem for information systems when the amount of managed data increases. Peer to Peer systems are usually used to solve scalability problems as centralized approaches do not scale without large dedicated infrastructure. But most current Peer to Peer systems do not take into account that indexed data can be dynamic and change their values very often. Thus, we propo...

متن کامل

Routing Algorithms for DHTs: Some Open Questions

Even though they were introduced only a few years ago, peer-to-peer (P2P) filesharing systems are now one of the most popular Internet applications and have become a major source of Internet traffic. Thus, it is extremely important that these systems be scalable. Unfortunately, the initial designs for P2P systems have significant scaling problems; for example, Napster has a centralized director...

متن کامل

Complex Queries in DHT-based Peer-to-Peer Networks

Recently a new generation of P2P systems, offering distributed hash table (DHT) functionality, have been proposed. These systems greatly improve the scalability and exact-match accuracy of P2P systems, but offer only the exact-match query facility. This paper outlines a research agenda for building complex query facilities on top of these DHT-based P2P systems. We describe the issues involved a...

متن کامل

Multi-set DHT for Range Queries on Dynamic Data for Grid Information Service

Scalability is a fundamental problem for information systems when the amount of managed data increases. Peer to Peer systems are usually used to solve scalability problems as centralized approaches do not scale without large dedicated infrastructure. But most current Peer to Peer systems do not take into account that indexed data can be dynamic. Thus, we propose the Multi-set approach, which ai...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005